A RIEMANNIAN VIEW ON SHAPE OPTIMIZATION 

VOLKER H. SCHULZ* 

Abstract. Shape optimization based on the shape calculus is numerically mostly performed 
by means of steepest descent methods. This paper provides a novel framework to analyze shape- 
Newton optimization methods by exploiting a Riemannian perspective. A Riemannian shape 
Hessian is defined yielding often sought properties like symmetry and quadratic convergence for 
Newton optimization methods. 

(N 

T— I 1. Introduction. Shape optimization is a vivid field of research. In particular, 

^D the usage of shape calculus for practical applications has increased steadily |H El 01 

^^ [13 mi [ini HSl HI] . Standard references for shape optimization based on the shape 

H calculus are [9l |2ll El] . The shape Hessian is already used as a means to accelerate 

u gradient based shape optimization methods [El [151 113 [22] ■ It is also used for 

^5 characterizing the well-posedness and sufficient optimality conditions [T^l HI] in 

r — particular applications and is reformulated in the framework of differential forms 

in [19_ . A fairly general framework for descent methods for shape optimization is 
U~~P presented in [TS] , 

^^ However, a general framework for the analysis of Newton-type shape optimiza- 

\„/ tion algorithms is still missing. A major reason is the lack of symmetry [Dl [T^ of 

r^ the shape Hessian, as it is commonly defined. In this paper, an attempt is made to 

cast shape optimization problems in the framework of optimization on Riemannian 
manifolds. There is a fairly large amount of publications available on the issue of 
optimization on Riemannian manifolds — mainly for matrix manifolds as in [I] [2S] . 
It is proposed in this paper, to view the set of all shapes as a Riemannian 
manifold and follow there the ground breaking work in j^H JUl [121 lEl \S\ ■ The result- 
ing manifold is an abstract infinite dimensional manifold, in contrast to the finite 
dimensional submanifolds of R" that arise in optimization on matrix manifolds. 
^^ Therefore, distance concepts have to be reviewed and used with somewhat more 

1— ( care. The key observation of this paper is that the action of an element of the 

^^ tangent space of the manifolds of shapes can be interpreted as the shape derivative 

(^ of classical shape calculus. Once this link is established, the concept of Riemannian 

CN shape Hessian, shape Taylor series, shape Newton convergence and sufficient shape 

^~^ optimality conditions follow quite naturally. 

^ In section [2] the notation for the manifold of shapes is introduced and for a 

particular example of a Riemannian geometry, the correspondence between Rie- 
, ; mannian geometry and shape calculus is established. The key element of the co- 

C^ variant derivative is rephrased in terms of the shape calculus. The Riemannian 

shape Hessian is defined and the Riemannian shape Taylor series formulated. Sec- 
tion [s] presents a generalization of the Newton convergence theory, established in [7] 
for linear spaces, on Riemannian manifolds. From that, convergence properties of 
variants of Newton's method on Riemanian manifolds follow immediately. Finally, 
section |4] discusses numerical experiments for shape optimization algorithms with 
linear and quadratic convergence properties. 

2. Riemannian Shape Geometry and the Shape Calculus. The purpose 
of this section is to demonstrate the possibility to define a Riemannian metric on 
the manifold of all possible shapes with a relation to the shape calculus. Since this 
point of view is new, we use the established framework of differential geometry for 
shapes which are C°° embeddings of the unit sphere. Of course, this framework has 
to be generalized for specific applications. However, the purpose of this paper is to 
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convey a new point of view rather than the impact on apphcations. Therefore, we 
assume in the interest of simphcity of the presentation maximum smoothness and 
restrict ourself to only 2D problems. 

It is assumed that the reader is familiar with the basic concepts of Riemannian 
geometry as they are, e.g.,: geodesies, exponential mapping, parallel transport. In 
[m[5T], a geometric structure of two-dimensional C°°-shapes has been introduced 
and consequently generalized to shapes in higher dimension in [231 ini [S] . Essentially, 
closed curves (and closed higher dimensional surfaces) are identified with mappings 
of the unit sphere to any shape under consideration. In two dimensions, this can be 
naturally motivated by the Riemannian mapping theorem. In this paper, we focus 
on two-dimensional shapes as subsets of R^ for ease of discussion, but mention the 
other publications above, in order to indicate that natural extensions of this paper 
to higher dimensional surfaces are conceivable. 

Here, we mean with "shape" a singly connected and compact subset of fi C R^ 
with fi ^ and C°° boundary dfl. As always in shape optimization, the boundary 
of the shape is all that matters. Thus, we can identify the set of all shapes with the 
set of all those boundaries. In [52], this set is characterized by 

Be{S\Wi^) := Emb(S'\R^)/Diff(5^) 

i.e., as the set of all equivalence classes of C°° embeddings of S-^ in the plane 
(Emb(S'^,R^)), where the equivalence relation is defined by the set of all C°° re- 
parameterizations, i.e., diffeomorphisms of S^ into itself (Diff(S'^)). The set B^ is 
considered as a manifold in [22^ and various Riemannian metrics are investigated. 
A particular point on the manifold Be{S^ ,Wi^) is represented by a curve c : S^ 3 
6 K^ c{9) E R^. Because of the equivalence relation (Diff(S'^)), the tangent space is 
isomorphic to the set of all normal C°° vector fields along c, i.e. 

TcBe = {h\h = an,ae C°°(S'\R)} 

where n is the unit exterior normal field of the shape fl defined by the boundary 
dfl = c such that n{9) _L ce(0) for all G S^. For our discussion, we pick among 
the other metrics discussed in [55] the metric family ioi A> 

G-^ : TcBe X TcBe -^ R 

{h,k)^ f {l + AK,{ef){hi9),k{0))\\cg{9)\\d9 

where Kc denotes the curvature of the curve c and (., .) and ||.|| mean the standard 
Euclidian scalar product and norm in R^. This does by no means mean that this 
metric is uniquely defined by the purpose of establishing a connection with the shape 
calculus. There are probably many more Riemannian metrics available which can 
be used in a similar way. li h = an and k = /3n, then this scalar product on T^B,, 
can be expressed more simple as 

G^{h,k) = ( {l + AnDaPds 
Jan 

where ds is the length measure on dfl = c. In ^22j it is shown that for ^ > 
the scalar product G defines a Riemannian metric on Bg and that the resulting 
Riemannian manifold {Be,G^) is complete. Thus, geodesies can be used to mea- 
sure distances and Cauchy sequences in terms of the geodesic distance converge. 
Unfortunately, this is not the case for the most simple member G" of the metric 
family G'^, where A = 0. An illustrative counter-example is given in [22]. In sec- 
tion ^ we use the fact that (B^jG"^) is a Riemannian manifold extensively and 



exploit the existence of the exponential map (exp) according to the usual definition 
in Riemannian geometry. 

Now, we want to analyze the correlation of the Riemannian geometry on B^ 
with shape optimization. In 2D shape optimization, one searches for the solution 
of optimization problems of the form 

min/(i7) 

where / is a real valued shape differentiable objective function. Often, the problem 
formulation involves explicit constraints in the form of differential equations and 
additional state variables asin[lllll[ini[2Hllini[2i[IZl[IllISl[ni[iZllSl[21l3Il 
im H] I but for the sake of ease of presentation, we can assume all those possible 
additional structures are contained implicitly within the mapping /. The shape 
derivative of / is a directional derivative in the direction of a vector C°° vector 
field T^ : R^ — >• R^ which can be represented on the boundary according to the 
Hadamard structure theorem [31] as 

df in)[V]= f g{V,n)ds, g:dn^n 

Jon 

If V\dn ~ an, this can be written more concisely as 

dfin)[V]= [gads 
Jdn 

In Riemannian geometry, tangential vectors are considered as directional derivatives 
of scalar valued functions. Since curves c G B^, can be interpreted as boundaries 
of domains fl with boundary c — 951, we can consider every scalar valued function 
f : c = dfl i-> R also as mapping / : il i-^ R. Thus, we see that the action of a 
tangent vector h g TcB^ on a scalar valued function / : Bg — >■ R can be interpreted 
in the shape calculus, via the unique identification of the boundary c — dfl with its 
shape ri, as the shape derivative of / with respect to an arbitrary C°° extension V 
of h in the whole domain fl with V\gfi = h. Thus, we can write 

h{f){c)^df{n)[V]= [gads 

ii h — an. Also, the gradient in terms of a Riemannian representation of the shape 
derivative in terms of the metric G can be written as 

The essential operation in Riemannian geometry is the covariant derivative V^A: 
which is a directional derivative of vector fields in terms of tangential vectors such 
that V/ifc G T^B^, if h,k € T^B^. Usually, the covariant derivative is written in 
terms of the Christoffel symbols. In [22^ explicit expressions for the Christoffel 
symbols are derived in terms of the Riemannian metric G . However, in order to 
reveal the relation with the shape calculus, we show another representation of the 
covariant derivative in theorem 12.11 

Theorem 2.1. Let fl e R^ 6e a shape and V,W e C°°(R2,R2) vector fields 
with orthogonal components a := {V\gQ,n), (3 := {W\Q^,n) such that h := an, 
k := j3n belong to the tangent space of Be- Then, the covariant derivative can he 
expressed as 

dB 1 2Atc' 

VyW := Vhk = -^a + -(kc + JT^^)"^ + ^«c(a/3)rr 

1 2Ak^ 

= {DW V, n) + -(^c + J— 4^) {y^ ^) {W, n) + Ak^HV, n) {W, n))rr 



where expressions "{■)tt " raean second order derivative in unit tangential direction 
of the shape boundary. 

Proof. The expression in theorem |2.1| is derived from the product rule for 
Riemannian connections, hG^{k,(.) = G^{Vhk,i) + G^{k,Vht.) for any (. e TcBe. 
We assume that Z e C°°(R2,R2) is a vector field with £ -.^ Z\an ^ -/n. Then 

h{G^{k, i)) =d( f (1 + AKl)l3jds) [V] (2.1) 

\Jdn J 

on on 

2AK^{Kla + arr)h + {'^ + AKl)^-ia (2.3) 

dn on 

+ (1 + Ak^)— ^/3q: + Kc(l + AKl)al3-ids 

QR 

2AKla + 2AKca{l3j)rr + (1 + AnDj^ja (2.4) 

dn on 

+ (1 + AnD^fia + Kc{1 + AnDafijds 
on 



where equation (2.4 1 is an immediate consequence of the discussion in section 2.2 
of |22j . Since the remaining properties of a Riemannian connection (C°° linearity, 
the Leibniz rule and the symmetry) are obviously satisfied, the expression above is 
shown due to the uniqueness of the Riemannian connection. D 

The Riemannian connection now gives the means to investigate the Hessian 
of an objective defined on a shape, resp. its boundary. As in [T], we define the 
Riemannian Hessian of a function / at the point c G i?e as the linear mapping of 
TcBe to itself defined by 

Ress f{c)[h] := V/igrad/ 

In the terminology of shape optimization and with the definition of vector fields as 
in theorem |2.1| and the identification of the boundary c = dfl with its shape, we 
may identify this with a now so-called Riemannian shape Hessian. 

Definition 2.2. For the setting of theorem \2.1\ we define the Riemannian 
shape Hessian as 

Hess/(f])[y] :=Vygrad/ 

The next theorem gives a correlation of the covariant shape Hessian with the stan- 
dard shape Hessian which is defined by repeated shape differentiation. 
Theorem 2.3. The Riemannian shape Hessian satisfies the relation 

G{Hessf{n)[VlW) = d{df{^)[W])[V] - df{n)[VvW] 



where V, W are defined as in theorem 2.1 and d{df {yi)\W])\V] denotes the standard 
shape Hessian as defined in |^. 

Proof. The Riemannian product rule gives 

ft(G(grad/, k)) = G(V;,grad/, k) + G(grad/, V ^k) 

We observe 

/i(G(grad/,fc)) = h{k{f)) = h{df{n)[k]) = d{df{n)[k])[h] = d{df{n)[v])[w] 

G(V;,grad/, fc) - G(Hess/(c)[/^],fc) = G(Hess/(f^)[F], W^) 
G(grad/, V,,fc) = Vhk{f) = df{^)[Vhk] = df{n)[VvW] 



D 

Corollary 2.4. The Riemannian shape Hessian is symmetric, i.e. 

G{Hessf{n)[V],W) = G{V,Hessf{n)[W]) 

Proof. The symmetry of the Riem.annian connection V gives with the vector 
field definitions of theorem 12.11 



Giliessfin)[V],W) - hikif)) - (V,fc)(/) = kihif)) - {Vkh){f) 
= G(y,Hcss/(f7)[Ty]) 

D 

Now we can use the Riemannian shape Hessian in order to formulate a Taylor 
series expansion as well as optimality conditions. Since the subsequent theorem 
holds in more general cases than just for B,,, we formulate it in a more general 
notation. 

Theorem 2.5. Let {J\f, G) he a Riemannian manifold with metric G and norm 
\\.\\ := G(.,.). The set U d M be a convex subset of M. We consider all x Cz M 
and Ax G T^Af with x,exp^(Aa:) € U and denote the parallel transport along the 
geodesic 7 : [0, 1] -^ A/", i H> 7(t) := exp^(iAa::) by Vaj3 : T^(q)A/' -> T^{fi)N . We 
assume for the Riemannian Hessian of a function f : U ^>^ M. the following Lipschitz 
property at x (z M : 

WVifiHess f{expJAx))VQ,i - Hess f{x)\\ < L\\Ax\\ , Vexp^(Aa;) e U 

with a constant L < 00. Then, we achieve the estimation 

\f{exp^{Ax))~f{x) + Gigradf{x),Ax) + lG{Hessf{x)Ax,Ax)\<hAxf 

2 D 

Proof Let us consider the mapping ip : [0,1] 3 t 1-^ /(exp^(iAx)). We note 
that for all differentiable functions and in particular for ip yields 
1 t 

ip"{s) ~ ip"iO)dsdt =. v^(l) - ^(0) - ^'(0) - ^^"(0) 

Since 

ip{l) = /(exp,(Aa:)) , ^(0) = f{x) , ^'(0) = G(grad/(x), Ax) 
(^"(0) = G{Hessfix)Ax,Ax) 

we observe 

|/(exp,(Ax)) - fix)+G{gmdfix),Ax) + ^G [Hess fix) Ax, Ax) \ 
1 t 



ip"{s)-ip"{0)\dsdt 
/ / \{{'PsfiHessf{exp^{sAx))'Po,s-Hessf{x))Ax,Ax)\dsdt 




1 t 




1 t 



< / \\'Ps.flHessf{exp^{sAx))ro,s~Hessf{x)\\\\Ax\\^dsdt 




1 t 

< I I sL\\Axfdsdt^ §llAxf 
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D 

Now, we can exploit the Taylor expansion of Theorem |2.5| for necessary and 
sufficient optimality conditions 



Theorem 2.6. Under the assumptions of Theorem 2.5 we obtain: 

(a) If X is an optimal solution, then Hess f{x) > 0, i.e. (Hess f{x)h,h) > 0, 
for all h e T^TV 

(b) If X satisfies grad f{x) = 0, and Hess f{x) is coercive, i.e. [Hess f{x)h,h) > 
c||/i|p, for all h G T^Af and for some c > 0, then x is a local minimum, 
provided Hessf{x) satisfies a Lipschitz condition as in Theorem \2.[^ 

Proof. The proof is identical with the standard proof in linear spaces. D 

Example 1. Now let us study a particular example, one of the simplest but 
nevertheless instructive shape optimization problems 

min f(fl) :— / ^p(x)dx 

where V" ■ K-^ — >■ R is a sufficiently smooth scalar valued function. We use the 
vector field definitions of theorem |2.f| The shape derivative of this objective and 
the G^-gradient are 

df{n)[V]^ f ^{V,n)ds 
Jon 

± 
1 + Ak 



grad/(c) = ^ , ^^.^ , c^dVl 



The shape Hessian is computed by shape differentiating the shape derivative 

d{df{n)[W][V])) ^d{j ^Pkds)[h] = f ^^^0^h + K^ipkhds 
Jan Jdn 'J^ 

-^r^kh + i^^r^h + Kcipkhds 
ion on dn 

(^ + Hci') {W, n) (V, n) + V^ (DW V, n) ds 
an on 

We observe that the shape Hessian is not symmetric and we are tempted to use the 
term "9'0/i9n + Kc'0" within a Newton-like optimization strategy, since we do not 
trust the nonsymmetric part of the Hessian. In contrast to that, the Riemannian 
shape Hessian computed by application of theorem |2.3| is given by 



G(ilessf{n)[V], W) = d{df{n)[W])[V] - df{n)[VvW] 

(^ + Hci^) {W, n) {V, n}+'ip {DW V,n)ds- [ ^ (Vy H/, n) ds 
an on Jon 

& + Hci^) {W, ri) {V, ri)+^ {DW V, n) ds 
an on 

ij{{DWV,n} + |(«, + ^^) {V,ri) {W,n} 
/an 2 l + Ani 

+ i:AKc{{V,n) {W,n))rrds 

(S + ^V' - T^^V') {V,ri) {W,n) - M^c{{V,n) {W,n)),rds 
an on 2 1 + AKi 

Now, as already abstractly shown in corollary |2.4[ we observe symmetry of the 
Riemannian shape Hessian also in this example. For small A > the term "5'0/9n-|- 
^ip" is decisive rather than the putative term above. We will study this example 
in more specific details in section |4J 



3. Convergence of Riemannian Shape Newton Methods. Now we for- 
mulate a contraction result for Newton iterations on manifolds which is in line with 
linear space theorems in |71 |TT] . Again, since the observations hold for more general 
Riemannian manifolds than just B^,, we use a more general notation. We proof con- 
vergence properties for Newton-like methods on Riemannian manifolds, but have 
always in mind that in this paper we want to solve a particular root finding problem 
for the gradient of an objective. Thus, every time the Jacobian is mentioned, we 
can think it as the Jacobian of the gradient and thus the Hessian of an objective 
on a Riemannian manifold. 

Theorem 3.1. We consider a complete Riemannian Manifold {Af,G) with 
norm \\.\\ := G(., .) on the tangential bundle TM . The set D d J\f is assumed to be 
singly connected and open. We are seeking a singular point of the twice differentiable 
vector field _F : A/" — > TAf by employing a Newton method on manifolds. The 
symbol J :— DF denotes the covariant derivative of F such that DF{x)v := VyF^. 
Furthermore, we assume that for all points x,y d D with y = expj.(Aa:). Ax :— 
—AI(x)F{x) and M{x) € Fiiid{TxAf)) and invertible and all t € [0, 1], the following 
Lipschitz conditions are satisfied along the geodesic 7 : [0, 1] — > M , t 1— > 7(i) := 
e'xp,j.{~tM{x)F{x)) with parallel transport Va,;} '. T^(a)-^ ~^ '^■yifi)-^ ■ 

(1) There exists lo < 00 with 

\\Miy)iVt,iJiMt))Vo,t ~ Po,iJ{x))Ax)\\ < ut HAxf 

(2) There is a constant upper limit k < 1 for the function k{x) in 

\\M{y)VoAF{x) + J{x)^x)\\ =: k{x) \\/\x\\ 

with k(x) < K. 
If Xq satisfies (Jg < 1, where S^ '.= n + 'w \\M{xk)F{xi;)\\ , fc = 0, 1, . . ., then follows 

(1) The iteration Xk+i '.— exp^ {—M{xk)F{xk)) is well defined and stays in 
Do:^{xeD\ d{x,xo) < ||M(a:o)i^(a;o)ll /(I - So)}. 

(2) There exists x € Dq with lim x^ — x in the sense that lim dcix^ , x) = 0, 

where da is the geodesic distance. 

(3) There holds the a-priori estimation 

dGA{x\x) < -\ IIAxfell < -^ IIAxfcll 

where Axk ■= —M{x^)F{x''). 

(4) There yields the contraction property 

IIAxfe+ill <4||Aa;fc|| = iK+'^\\6xk\\)\\Axk\\ 

(5) If the mapping x M- M{x) is continuous and M{x) is nonsingular, then x 
is not only a fixed point hut rather a root of the equation F{x) — 0. 

Proof. The proof follows the standard lines — now in manifold notation. First, 
we note the manifold variant of the fundamental theorem of calculus along the 
geodesic 7 for any C^ vector field X (cf. [16]): 

t t 

Xi^it)) ^Vo^tX{-/{0)+ J Vs,t^^is)X-,(^s)ds = Vo,tXi-fiO)+ J V,^tDX{j{.s))j{s)ds 




and have in mind that obviously 7(5) ~ Vo,s^x ■ We show the contraction property 
and use the abbreviation R{x'^) :— F(x*'') — J{x'^)M{x^)F{x^) 

||Ax'=+i|| = ||M(x'=+i)F(x'=+i)|| 

= \\M{x^+^)Vo,iR{x'') + M{x^+^)[F{x''+^) - 7'o,ii?(x'=)] || 



1 

M(a;'=+i) j Vt,iDF{-f{t))Vo,tAx'' - Vo^iJ{x'')Ax''dt 



< kWAx'- 



<K||Aa;'=||+ / \\Mix''+^){Vt,iDF{-f{t))Vo^t - Vo,iJ{x''))Ax''\\dt 


1 



Now we conclude inductively that Sk < l for all k and thus the series {(5/c}fcLo ^'^'^ 
{||Aa;fe||}^o 3'™ montonically decreasing. Now, we show that the series {xk}kLo 
stays in Dq. We use the triangle inequality which holds because of the metric 
properties of the Riemannian metric. 



„k ^0\ 

— I I X— I [|-"VIII/\(7. 1 11; i;i-'Vlll/\#T lllf-"!;!, , 

fe-1 fc-1 



< V II Ax-'' II < V f^n l|Ax°|| < ^ (geometric series) (3.2) 

- /L^ II II - A^ u II II - 1 - (5n J \ / 



dG{x',x ) = dG(exp(Axfc_i) o .. .oexp(Axi) oexp3.^(Axo),xo) (3.1) 

|Aa;o|| 
^5 11^-^^' II ^ " 

Analogously, we show the Cauchy property of the series {a^fcl^g- 

dG(a;",a;") = d(exp(Aa;m_i) o . . . o exp(Aa::„+i) o exp3,^(Aa;„),x„) (3.3) 

771—1 777— 1 771—1 

< Y. \\Ax^\\ < ^ S^, \\Ax-\\ < J2 S'o llAxl (3.4) 

/c=7i k—n k—n 

< e for any e > 0, if n, ?7i — )■ 00 (3-5) 

The a priori estimation is yet another application of the triangle inequality. Now, 
if M (.) is continuous and x^ — > i, we pass the defining equation 

Xfc+i := exp^^(-M(xfe)F(a;fc)) 

to the limit, having in mind that exp is continuous and expj,(2;) = y implies z = 
for any y. Thus, we observe that M{x)F(x) = 0. Finally, if M[x) is nonsingular, 
we conclude that F{x) = 
D 

Now, we proof quadratic convergence for the exact Newton method-again com- 
pletely parallel to the discussions in [3 |TT] . 

Corollary 3.2. Together with the assumptions of theorem \3.1\ we choose 
M{x) = DF(x)^^ , thus defining the exact Newton method on manifolds. The re- 
sulting iteration converges locally quadratically, i.e., there is a k GlM and a C < 00 
such that 

dG{x''+^,x)<CdG{x'',xf, yk>k 

where da denotes again the geodesic distance. 

Proof. Because of the choice M{x) — DF{x)~^, we observe for k in theorem 
Ofrom 

\\Miy)roAFi=c) + J{x)Ax)\\ = \\M{y)raA{F{x) ~ Jix)J{xr'F{x))\\ = 



And thus k = 0. Since {Sk}'^^Q is monotonically decreasing to zero, there is k such 
that Sk = ^ ll^^'^ll — i for all k > k, which implies also Sk/{1 — Sk) < 1/3 for all 
k > k. From the fact that the exponential function is an isometry and from the a 
priori estimation in theorem |3.1[ we observe 



llAx'^ll ^dG{x''+\x'')<dG{x''+\x)+dG{x^x)< -^\\Ax''\\+dG{x'',x) 

1 - dk 

< -||Aa;'=||+c^G(a;^^) 

and therefore ||Aa;'^|| < ^d{x'',x). Now, we use again the a priori estimation 

dGix'^+\i) < /^ ||Ax,+i|| < ^^ llAa^i' < LdG(x^x)2 
1 - dk+i 6 2" 2 

D 

In shape optimization, it is very rare that one can get hold of J{x^) (i.e., the 
Hessian of an objective) away from the optimal solution. However, in many cases, 
expressions can be derived, which deliver the exact Hessian, if evaluated at the 
solution. That means that often the situation occurs that an approximation M'^ 
of J{x'') is available with the property A/jT^ — > DF{x),k — > oo. We show local 
superlinear convergence in those cases. 

Corollary 3.3. Together with the assumptions of theorem \3.1\ we choose 
Mk :== M{x'') such that 

\\[M-^ - DF{x'^)]Ax''\\ 

-^ 0, K — ?► oo 



IIAx^ll 

If there is a constant C < oo such that the approximation Mk is uniformly bounded, 

ll-^fcll < C, Vfc, then the resulting Newton iteratio n con verges locally superlinearly. 

Proof. We observe for Kk '■= k.{x^) in theorem 



3.1 



^ \\Mk+iVo,i{F(x'') + J{x^)Ax% ^ ||Affc+i7^o,i(Affc-^ + J(x^))Ax)|| 
'"'" IIAxfeJI " IIAx^ll 

^ \\Mk+i\\\\Vo^,{M-^ + J{x^))Ax)\\ 
\\Ax^ 

We use again the apriori estimation in an analogous fashion as in the proof of 
corollary 3.2 to obtain 

J / k+i -\ ^ 5k+i 1 — 2(5fe , f^,i „, \ — 26k „ , , k+i -\ 
dG{x''+\x) < —dGix^+\x) < —^SkdGix'^+\x) 

1 - Ok+i I- Ok (1 - Okr 

Since < ^^, we observe that (1 — 2Sk)/{l — 5kY < 1 which yields 
dcix'^^^Tx) < SkdG{x'',x) with Sk -^ 0, k -^ oo 



Remarks: 

- If we can assume that the quality of the Hessian approximation satisfies even 
IIM^^ + J(x'')|| < C II Ax'^ll then we observe also quadratic convergence as 
an immediate consequence of an obvious refinement of the proof of corollary 
[331 

- Note that the condition in corollary |3.3| is similar to the Dennis-More con- 
dition [101 [5j , which is also applicable for quasi-Newton update techniques 
on Riemannian manifolds 251. 



10 

4. Numerical Experiments. Here, we study the linear and quadratic con- 
vergence properties of standard optimization algorithms applied to an example for 
shape optimization in R^ which is as simple as possible, but nevertheless reveals 
structures reflecting the discussion of this paper. We consider the following problem: 

min/(f7) = / xl+fix^-ldx, fJ- > I (4.1) 

^^ Jn 

The obvious solution to this optimization problem is the shape fl with boundary 

dn^ixelR^ \ xl+ ii^xl = 1} 

And from the discussion above, we observe that the Riemannian shape Hessian at 
this solution is a multiplication operator 

Hessf{n)V ^v-V , v{s)e[-,2],sedh 

Therefore, the Hessian is coercive and thus the solution is locally unique and the 
Newton iteration should converge quadratically close to the solution. 

The application of the exponential mapping within the Newton method is an 
expensive operation. It is recommendable to replace this step by a so-called retrac- 
tion mapping [T1|32J|1S], i.e., a smooth mapping r^ : T^Be — ?> B,. with the following 
properties: 

a) r^ (0) = x 

b) Drx{Q) = idr^Bc (local rigidity condition [1]) 

with the canonical identification T^T^B^, = T^B^. Properties a) and b) are also 
satisfied by the exponential mapping. An example of this kind of mapping is the 
following mapping that we will use in our implementation of Newton optimization 
methods on B^'. 

r^ : T^Be -^ B^ 

f x + ri: S"! ^ R2 

"^^y e^x{e) + ri{e) 

The implementation of the retraction r^ defined above is much simpler than the 
implementation of the exponential map. However, one should keep in mind that 
this retraction does not leave B^. only for tangent vectors rj which are small enough. 
Now, the Newton method for optimization is generalized to 



X 



fe+i _ 



= r,.{-M^F{x^)) (4.2) 



where the special case r^ = exp^ considered above falls also into this class of 
iterations. Since any retraction is a smooth mapping and because of the local 
rigidity, we observe that 

dG{x,r^{T])) = dG(a;,exp^(7y)) + cdG(2;,exp^(77))2 

for some constant c and for small enough tangential vectors rj. Plugging this into the 



estimations (3.1 , 3.2 1 and (3.3 3.4), we conclude that locally linear, superlinear and 
quadratic convergence properties are not changed by using more general retractions 
rather than the special case of the exponential map. 

The distance of a shape iterate Vt'^ to the optimal shape Vt should be measured by 
the length of a connecting geodesic. However, in order to reduce the numerical effort, 
we exploit the fact that a connecting geodesic is also the result of an exponential 
map, which on the other hand is an isometry. Thus, we use for shapes close enough 



11 

to the optimal solution such that fl = fl + h with h ~ ct^lan ^ second order 
accurate substitute for the geodesic distance the measure 

d{n'',n)^ I \a\ds (4.3) 

i.e., just the absolute value of the area between the iterates and the optimal shape. 



Because of the simplicity of the objective (4.1 ), the integrations for the evalu- 
ations of the objective can be performed in the form of integration along the shape 
boundary in polar coordinates in the following way 



where c^ :— d 



1 
n 



t^Js^ 3 " ^ '"Km 



fl is the boundary of the domain fl scaled by fi in the 



X2-direction and the last integral can be numerically evaluated by usage of the 
trapezoidal rule for a piecewise linear boundary discretization. Similarly, in the line 



of (4.3 ), first order approximations of the distance of curves to the optimal solution, 



which differ only by a small amount are computed by 

J(f}^il)=/ iiic(.)ii-iic(.)iiid. = - / iiicns)ii-ii^'^^ 

Jan fJ-Js^ ¥ 

Furthermore, the tangential (and thus the unit normal) vector field for the iterates 
is computed by central differences. 

The first idea for setting fj, in the numerical experiments is /z = 1. However, 
then Hessf{Cl) — 2 • idx - b^ which means that the method of steepest descent with 
exact line search can exactly mimic the behaviour of the Newton method — including 
quadratic convergence. However, we want to experience the difference between 
steepest descent methods and Newton's method in this example. Therefore, we 
choose fj, = 2 for performing the computations. The initial shape is defined by its 
boundary 9J7° :— c'^ as 

^0 1/ cos(,s)- (3.1511 -sin(2.)|cos(.) \ 

^ ^ 2 V sni(s) - 0.15|1 -cos(2s)|cos(s) J ' l ; j 

just to be somewhat more interesting than a simple circle. We use a piecewise 



linear approximation with 100 equidistant pieces in [0, 27r]. Table 4.1 shows the 
performance of the shape steepest descent method versus the shape Newton method 
with exact line search, where we use A = in the definition of G similarly as in 
[5D]. The iterations are stopped at a distance of less than 10^^ to the solution. 
For the steepest descent method, we observe linear convergence with a factor of 
approximately 0.3, while for the Newton method we see clear quadratic convergence. 
Figure |4] shows the various shape boundaries during the respective iterations. 

5. Conclusions. In this paper, a novel point of view on shape optimization 
is presented — the Riemannian point of view. The Riemannian shape Hessian is in- 
troduced which can serve as a much more useful notion of second shape derivative 
than the classical so-called shape Hessian. With this term, classical optimization 
results, known in linear spaces, can be formulated and proofed in the area of shape 
optimization. Newton optimization methods for shape optimization are analyzed. 
A new model problem for shape optimization is introduced which mimics the prop- 
erties of a standard L^-quadratic model problem in linear spaces — in so far as the 
Riemannian Hessian is just a multiplicative operator. 
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Table 4.1 
Performance of shape algorithms: steepest descent (indices SD) versus Newton (indices NM), 
a denotes the line search parameter chosen by an exact line search applied to the objective. 



It.-No. 


fmo) 


din'^sD^Ci) 


CUSD 


/(f^W) 


m'kM,^) 


OiNM 





-1.7664 


0.9222 


0.51 


-1.7664 


0.9222 


0.64 


1 


-2.0716 


0.2142 


0.31 


-2.0865 


0.1359 


1.00 


2 


-2.0921 


0.5882E-01 


0.37 


-2.0944 


0.4265E-02 


1.00 


3 


-2.0942 


0.1662E-01 


0.32 


-2.0944 


0.1037E-04 


1.00 


4 


-2.0944 


0.4459E-02 


0.35 


-2.0944 


0.2590E-09 




5 


-2.0944 


0.1356E-02 


0.33 








6 


-2.0944 


0.3951E-03 


0.34 








7 


-2.0944 


0.1218E-03 


0.33 








8 


-2.0944 


0.3643E-04 


0.35 








9 


-2.0944 


0.1194E-04 


0.32 








10 


-2.0944 


0.3421E-05 


0.35 








11 


-2.0944 


0.1137E-05 


0.32 








12 


-2.0944 


0.3305E-06 


0.35 








13 


-2.0944 


0.1109E-06 


0.30 








14 


-2.0944 


0.2894E-07 


0.39 








15 


-2.0944 


0.1008E-07 


0.36 








16 


-2.0944 


0.3723E-08 












Fig. 4.1. Visualization of the shape iterates for the steepest descent (left) and for Newton's 
method (right) from initial (blue) to solution (red) shape. 



Several aspects are now open for further research: This paper only deals with 
C°°-boundaries of 2D shapes. Of course, for practical purposes, this is not enough. 
The regularity has to be reduced, one has to go from dimension 2 to 3 and one 
has to treat shapes which are just part of submanifolds of R"^, rather than whole 
embeddings of the unit sphere. Furthermore, the implications for practical shape 
optimization have to be analyzed. This will all be covered in subsequent papers. 
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